Collaborative annotation of genes and proteins between UniProtKB/Swiss-Prot and dictyBase

نویسندگان

  • Pascale Gaudet
  • Lydie Lane
  • Petra Fey
  • Alan Bridge
  • Sylvain Poux
  • Andrea H. Auchincloss
  • Kristian B. Axelsen
  • Silvia Braconi-Quintaje
  • Emmanuel Boutet
  • P. Brown
  • Elisabeth Coudert
  • Ruchira S. Datta
  • W. C. de Lima
  • Tulio de Oliveira
  • Severine Duvaud
  • N. Farriol-Mathis
  • Serenella Ferro
  • Marc Feuermann
  • Alain Gateau
  • Ursula Hinz
  • Chantal Hulo
  • Janet James
  • Silvia Jimenez
  • Florence Jungo
  • Guillaume Keller
  • Phillippe Lemercier
  • Damien Lieberherr
  • Madelaine Moinat
  • Anastasia N. Nikolskaya
  • Ivo Pedruzzi
  • Catherine Rivoire
  • Bernd Roechert
  • Michel Schneider
  • E. Stanley
  • Michael Tognolli
  • Kimmen Sjölander
  • Lydie Bougueleret
  • Rex L. Chisholm
  • Amos Bairoch
چکیده

UniProtKB/Swiss-Prot, a curated protein database, and dictyBase, the Model Organism Database for Dictyostelium discoideum, have established a collaboration to improve data sharing. One of the major steps in this effort was the 'Dicty annotation marathon', a week-long exercise with 30 annotators aimed at achieving a major increase in the number of D. discoideum proteins represented in UniProtKB/Swiss-Prot. The marathon led to the annotation of over 1000 D. discoideum proteins in UniProtKB/Swiss-Prot. Concomitantly, there were a large number of updates in dictyBase concerning gene symbols, protein names and gene models. This exercise demonstrates how UniProtKB/Swiss-Prot can work in very close cooperation with model organism databases and how the annotation of proteins can be accelerated through those collaborations.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

UniSave: the UniProtKB Sequence/Annotation Version database

SUMMARY The UniProtKB Sequence/Annotation Version database (UniSave) is a comprehensive archive of UniProtKB/Swiss-Prot and UniProtKB/TrEMBL entry versions. All changed Swiss-Prot and TrEMBL entries are loaded into the UniSave as part of the public bi-weekly UniProtKB releases. Unlike the UniProtKB, which contains only the latest Swiss-Prot and TrEMBL entry versions, the UniSave provides access...

متن کامل

Last rolls of the yoyo: Assessing the human canonical protein count

In 2004, when the protein estimate from the finished human genome was only 24,000, the surprise was compounded as reviewed estimates fell to 19,000 by 2014. However, variability in the total canonical protein counts (i.e. excluding alternative splice forms) of open reading frames (ORFs) in different annotation portals persists. This work assesses these differences and possible causes. A 16-year...

متن کامل

HAMAP: a database of completely sequenced microbial proteome sets and manually curated microbial protein families in UniProtKB/Swiss-Prot

The growth in the number of completely sequenced microbial genomes (bacterial and archaeal) has generated a need for a procedure that provides UniProtKB/Swiss-Prot-quality annotation to as many protein sequences as possible. We have devised a semi-automated system, HAMAP (High-quality Automated and Manual Annotation of microbial Proteomes), that uses manually built annotation templates for prot...

متن کامل

Last rolls of the yoyo: Assessing the human canonical protein

In 2004, when the protein estimate from the finished human genome was only 24,000, the surprise was compounded as reviewed estimates fell to 19,000 by 2014. However, variability in the total canonical protein counts (i.e. excluding alternative splice forms) of open reading frames (ORFs) in different annotation portals persists. This work assesses these differences and possible causes. A 16-year...

متن کامل

UniProt-DAAC: domain architecture alignment and classification, a new method for automatic functional annotation in UniProtKB

MOTIVATION Similarity-based methods have been widely used in order to infer the properties of genes and gene products containing little or no experimental annotation. New approaches that overcome the limitations of methods that rely solely upon sequence similarity are attracting increased attention. One of these novel approaches is to use the organization of the structural domains in proteins. ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 2009  شماره 

صفحات  -

تاریخ انتشار 2009